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The overconfidence, a well-established bias, in fact leads to unrealistic expectations or faulty assessment. So 
it remains puzzling why such psychology of self-deception is stabilized in human society. To investigate this 
problem, we draw lessons from evolutionary game theory which provides a theoretical framework to address 
the subtleties of cooperation among selfish individuals. Here we propose a spatial resource competition 
model showing that, counter-intuitively, moderate values rather than large values of resource-to-cost ratio 
boost overconfidence level most effectively. In contrast to theoretical results in infinite well-mixed 
populations, network plays a role both as a "catalyst" and a "depressant" in the spreading of overconfidence, 
especially when resource-to-cost ratio is in a certain range. Moreover, when bluffing is taken into 
consideration, overconfidence evolves to a higher level to counteract its detrimental effect, which may well 
explain the prosperity of this "erroneous" psychology. 

As is well known, confidence leads to success in a wide range of domains including business, job perform- 
ance, and competitions for champions 1-3 . Ironically, when confidence is relatively high, the whole human 
beings fall victim to overconfidence, which is considered to be the most "pervasive and potentially 
catastrophic" of all the cognitive biases by some psychologists 4,5 . Such exaggerated confidence is often blamed 
for labor strikes, wars, and stock market bubbles 6 ". But like a coin has two sides, some other authors hold 
completely different opinions, insisting on its advantages in boosting morale, ambition, resolve and persistence 
so as to increase the probability of success 912 . Regardless of different views, the prevalence of overconfidence is an 
undeniable fact. For example, by asking people how confident they are of specific beliefs, researchers find that 
confidence systematically exceeds accuracy, implying people are more sure that they are correct than they deserve 
to be 13,14 . Further observations also show that individuals incline to form overly positive judgment of their 
abilities, which is called "better-than-average effect" 15 . 

Overconfidence is a fascinating area of research, since it touches upon many different disciplines ranging from 
sociology to economics. Mounting effort have been devoted to unveiling the reasons for the pervasiveness of this 
faulty psychology. Prior work have mostly offered three explanations. The first posits a motivated bias: 
Individuals are driven to be overconfident because it provides them with psychological benefits 16 . The second 
explanation highlights the cognitive processes that may sometimes produce directional biases 17 . An additional 
possibility is that overconfidence enhances the person's social status 18 . Apart from the above results, Johnson and 
Fowler have presented an evolution model, demonstrating that overconfidence often prevails over accurate 
assessment as long as benefits from contested resources are sufficiently large compared with the cost of com- 
petition 12 . In fact, Johnson and his coauthors used standard assumptions about evolution dynamics 19 , under 
which whoever get larger fitness are more likely to survive, and better strategies will be imitated. 

Treating whether to be overconfident or not as a strategy, we can associate the study of this psychology with 
evolution game theory which was originally introduced as a tool for studying animal behavior 20,21 , but has become 
a general approach that transcends almost every aspect of evolutionary biology 22 . The traditional approach of 
evolutionary game theory uses deterministic dynamics to describe infinitely large, well-mixed populations 23,24 . To 
understand evolutionary game dynamics in finite-sized populations, a stochastic approach is developed 25 " 28 . 
Particularly, network has been found to be an important ingredient affecting evolutionary game dynamics. 
While pioneering works mainly focus on spatial games on regular lattices 27,29 " 37 , more studies have been expanded 
to general structured populations, such as small-world 38 42 ,scale-free networks 43 " 47 , and random networks 48,49 . 
Taking into account the conspicuous effect of population structure in evolutionary dynamics, one may then ask 
how psychologies or emotions transmit between interacting individuals who site on specific networks, and 
whether the transmission has profound significance. In fact, by reconsidering the concept of imitation, 
Szolnoki et al. find that copying emotions such as goodwill and envy instead of pure strategies from more 
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successful players can even resolve social dilemmas on structured 
populations, additionally pointing out that the emotional profile 
depends not only on the payoffs but also on the heterogeneity of 
the interaction network 50,51 . 

Inspired by all the previous works above, we propose a spatial 
resource competition model, simple but general, aiming at dealing 
with the puzzle how overconfidence spreads and fixes in structured 
populations. Square lattice prescribes that each individual competes 
with his 4 nearest neighbors to collect available resources. Then 
players are more likely to learn the psychology of the successful, 
despite their real capabilities. Different from the model in Ref 2 , 
bluffing behavior has been incorporated into our model. Although 
bluffing is often argued to be unstable in natural selection' 12 , it is 
usually not reliably distinguished from the true ability 52 . In particu- 
lar, successful disguises (bluffing) and real capacity need a much 
longer time scale to evolve than that of overconfidence, so they are 
treated as constant indices in our model. We will show in this study 
that, moderate values of resource-to-cost ratio rlc create a more 
favorable environment for the evolution of overconfidence in struc- 
tured population. More importantly, the intricate relationship 
between bluffing and overconfidence may be a potential driving force 
for the high level of overconfidence. The simulation and theoretical 
analysis details are described in the Methods. 

Results 

We begin by presenting overconfidence level f Q as a function of 
resource-to-cost ratio r/c while initial fractions of different types of 
individuals are equal (see Fig. 1(a)). It is shown that moderate values, 
rather than high values of r/c, boost f a effectively in structured popu- 
lations. Interestingly, regardless of the values of a. and /?, the plateau 
of high overconfidence level sharply drops to a lower platform once 
rlc exceeds the critical value 2.0. This can be attributed to the fact that 
considerable amount of resource stimulates everyone to make a 
claim actively, leading to intensive conflicts that help one to recog- 
nize his real capability. Moreover, the existence of bluffing plays a key 
role on the prosperity of overconfidence, especially when rlc is high. 
As is shown in Fig. 1(a), without bluffing (ft = 0),/ o is stabilized at a 
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much lower level than that when bluffing exists for rlc > 2.0. Players 
seem to be more apt to be overconfident when facing opponents who 
are not as powerful as they display, which accords with common 
beliefs. However, excessive overconfidence intensity might inhibit 
its spreading among the population. For instance, when a s 0.5 
one individual overestimates himself so seriously that he can even 
challenge opponents from higher rank of real capability (the rank gap 
is 0.5), and this choice never pays off. 

As a comparison, we also consider the evolution of overconfidence 
in well-mixed populations (see Fig. 1(b)), by means of extended 
replicator dynamic (see Methods for details). It is found that network 
structure works as a "double edged sword": When rlc is relatively 
small, it plays a role as "catalyst" to impel the spreading of over- 
confidence; When resource is rich (larger r/c), it acts as "depressant" 
to cool down the atmosphere. The primary cause lies in the fact that 
network produces a local "celebrity effect" 53,54 . Specifically, indivi- 
duals with the highest real capability get more payoff in each game 
round, and thus do not incline to change their overconfidence states. 
In networks, they act as hubs to influence their neighbors, forming 
non- overconfidence clusters to resist invasions (detailed explana- 
tions will be discussed later). Therefore no matter how large rlc is, 
the existing unbiased elites ensure that overconfidence cannot dom- 
inate the whole population. In contrast, in well-mixed populations 
individuals always imitate the "fittest" strategy globally, thereby x* is 
more sensitive to the values of rlc than f Q does, and x* will finally 
reach a fully overconfident state if resource is rich enough (r/c > 4.0) 
when bluffing exists. However, common traits in both structured and 
unstructured population exists that bluffing promotes overconfi- 
dence level (see Fig. 1 (a) and (b)). 

We next evaluate the impact of overconfidence intensity a and 
bluffing intensity fi on overconfidence level / 0 , as presented in Fig. 2. 
When r/c =1.5 (see Fig. 2(a)), f 0 sustains a high level over the whole 
parameter plane of a and /?. This confirms that moderate values of 
resource-to-cost ratio elevate overconfidence, which is robust to 
changes in overconfidence intensity and bluffing intensity. 
However, when rlc = 3 (a large resource-to-cost ratio, see 
Fig. 2(b)), distinction in/ D s with respect to a. and /J becomes more 
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Figure 1 | (a) Stable fraction of overconfident individuals f 0 in structured population and (b) equilibrium frequency of overconfident individuals 
x* in infinite well-mixed population as a function of resource-to-cost ratio rlc for different values of a and /?. Data presented in Panel (a) are obtained by 
means of Monte Carlo simulations, while in Panel (b) are obtained by means of extended replicator dynamic (see Methods for details). Hollow symbols in 
Panel (a) and dash line in Panel (b) correspond to cases where bluffing does not exist (/J = 0). Other parameters: f 1Q = 0.5, f B = 0.5. 
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Figure 2 | The 3-D plots showing the stationary fraction of overconfident individuals f 0 in dependence on both a and /J for (a) rlc = 1.5 and (b) r/c = 3. 

The minimum f Q is about 0.77 in Panel (a) showing that overconfidence is always favored under moderate values of r/c. When r/cis high, f a becomes more 
strongly dependent on the combination of i and ft. The maximum f 0 is reached when both a and ft are high, indicating that higher values of 
overconfidence is evolved in response to higher bluffing activities. 



conspicuous. The maximum f Q is reached within the region where 
large a is accompanied by large /?. Note that for any fixed value of /i, 
fo is always larger within the region above diagonal (i.e., a > /?) than 
below (i.e., a < fi). In other words, an overconfidence intensity 
slightly larger than bluffing intensity induces higher overconfidence 
level. In this sense, overconfidence is evolved as a tool to conquer the 
negative effect of others' bluffing. One should believe himself more 
than believing others 9 . 

We have further studied the impact of initial frequencies of over- 
confident individuals/ 0 and bluffing individuals/^ on the final over- 
confidence level / 0 , as presented in Fig. 3. It can be observed that at 
any given value oif IO , increasing/; will elevate f 0 . A larger f B means 
that more individuals display an exaggerated capability. When sur- 
rounded by more bluffing neighbors, one is more likely to be prof- 
itable if he is overconfident. Meanwhile, for any given f B , f Q also 
monotonously increases with/ ro - Unlike the influence of initial fre- 
quency on the evolution of cooperation in structured population 55 , 
we find that the system always converges to a higher overconfidence 
level compared to the initial one (i.e.,/ 0 > fio), showing that over- 
confidence is constantly favored by natural selection once bluffing 
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Figure 3 | Influence of initial overconfidence level f IO and initial bluffing 
level f B on stationary fraction of overconfident individuals f 0 in the 
contour form. Parameters are: y. = 0.4, ft = 0.3, and rlc = 2.5. Similar 
results can be obtained whenever i > ft and rlc > 2.0. Overall, larger f IO 
corresponds to higher f 0 . For any given value of f IO , increasing f B further 
promotes / 0 , showing that overconfidence is favored when bluffing exists. 



exists. Our investigation shows that the prosperity of bluffing stimu- 
lates overconfidence to prevail, given that a> p and rlc > 2.0. 

For better understanding the influence of network structure on the 
evolution of overconfidence, we present in Fig. 4 some typical snap- 
shots of strategy patterns in time evolution. It can be observed that, 
regardless of the different initial distributions, overconfidence 
spreads quickly and flourishes. Individuals who have high real ability 
(H), considered as "elites" here, play decisive roles in guiding social 
mentality, and thus they become the core of the clusters. As evolution 
proceeds, the number of overconfident elites (blue) increases, indi- 
cating that they have an advantage over the unbiased elites (yellow). 
However, the latter can also take advantage of the network structure 
to form some solid clusters (yellow + red) to resist invasion, which 
explains why the unbiased ones will not die out in the long run in 
structured populations. Notably, the overconfidence clusters (blue + 
green) tend to connect with each other, forming larger ones to 
expand their territory, especially when initial fraction of overconfi- 
dence is relatively high (f IO = 0.3). Note that besides network reci- 
procity 29-56,57 , local "celebrity effect" also plays important roles. 
Individuals are susceptible to neighbor elites' psychology. These fol- 
lowers of overconfident or unbiased elites work as "buffer" in 
between, preventing the direct confrontation of two types of elites. 

Moreover, who benefit more from such a resource competition 
game draws our attention, and thus we compare the average payoffs 
of different types of individuals in Fig. 5. Firstly, real ability is fatal in 
deciding one's faith in resource competition. The higher one's real 
ability is, the higher average payoff one acquires. Just as illustrated in 
Fig. 5 (a), there exist clear-cut gaps between income hierarchies, and 
the lower level can never outcompete the upper one, no matter how 
the initial frequencies of bluffing and overconfident individuals are. 
Secondly, whether bluffing ones perform much better than unbiased 
ones also depends on their real abilities. The low capability bluffing 
ones have the whip hand over those who do not bluff, especially when 
bluffing and overconfidence are rare initially (see the left panel of 
Fig. 5(b)). While for middle and high ability, the advantage of bluff- 
ing is relatively low (see the middle and right panel of Fig. 5 (b)). 
Common for individuals of any level of real ability, bluffing is more 
effective when the others do not bluff and meanwhile do not develop 
the overconfidence, the useful tool countervailing bluffing. 

Furthermore, it is worth studying the situation not only overcon- 
fidence state (OS), but also bluffing state (BS) could be subject to 
evolution process. In other words, we consider an extended model 
that individuals can mimic any feature of others except the real 
ability. Different from the case with fixed bluffing level (see Fig. 1), 
where f a suddenly drops once rlc > 2.0, here f Q monotonously 
increases with r/c (see Fig. 6). And in the latter case,/ Q is higher than 
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Figure 4 | Typical snapshots of spatial patterns formed by different types of individuals under initial overconfidence levels r}o = 0.1 ((a)-(c)) and 
f} 0 = 0.3 ((d)-(f)) at different time steps. Overconfident ones are colored blue (high real ability) and green (middle and low real ability). Unbiased ones 
are colored yellow (high real ability) and red (middle and low real ability). The stationary fractions of overconfident ones are 42.5% for/ ;Cl = 0.1 and 
62.6% for f IO = 0.3. In structured population, high real ability but unbiased individuals form clusters to resist the invasion by overconfident ones, 
explaining the survival of unbiased ones even with high rlc. The size of the square lattices is 100 X 100. Other parameters: a. = 0.4, ji = 0.3, and rlc = 3. 



the former. Meanwhile, f B evolves to a higher value relative to the 
initial one (0.5) when rlc > 2. And^g is further promoted with larger 
bluffing intensity ft (see Fig. 6(b)). This may provide an explanation 
why/ 0 evolves to a higher level and does not decrease when rlc > 2. 
Just like what we have shown in the scenario with fixed f B (see Fig. 3), 
a larger percentage of bluffing individuals induces higher overconfi- 
dence level. It seems that overconfidence may gain more popularity if 
the skill of bluffing can be learned easily. 

Discussion 

In summary, we have investigated the impact of spatial structure and 
bluffing on the evolution of overconfidence in the resource competi- 
tion game. In contrast to previous results in well-mixed popula- 
tions 12 , the stable state is no longer composed of monomorphic 
overconfident individuals when the resource-to-cost ratio rlc is suf- 
ficiently large. Overconfidence never dominates the whole popu- 
lation and is more adaptive for moderate rlc. This is partially 
ascribed to the local celebrity effect caused by the population struc- 
ture and real ability hierarchies. Furthermore, a potential benefit far 



exceeding the cost of competition stimulates more conflicts 12,58 , 
which in turn depresses the psychology of self-deception, i.e., the 
overconfidence. 

The existence of bluffing behavior usually leads to ambiguity in 
one's perception about other's real ability. Our study shows that 
bluffing promotes overconfidence. Overconfidence is beneficial for 
individuals in an environment where bluffing is prevalent, because it 
makes them dare claim and compete for the resource, which would 
not happen without such "erroneous self-deception". Therefore 
overconfidence is just like a tool to conquer the negative effect of 
bluffing. This conclusion is robust to initial distributions or intensity 
of overconfidence and bluffing. Furthermore, the advantage of bluff- 
ing is more significant within communities of relatively low real 
abilities. If bluffing can be learned by others, it will become popular 
among individuals, further promoting overconfidence level. 

All in all, the spreading of overconfidence is investigated in a more 
realistic model, and we propose that the existence of bluffing is the 
key factor for the prosperity of the bias towards exaggerated personal 
qualities and capabilities. We hope that these observations could 




Figure 5 | Comparison of the payoffs of individuals with different real abilities, (a) The 3-D plot showing the normalized average accumulated payoffs of 
individuals with high (H, crimson), middle (M, light green), and low(L, blue) real abilities respectively, as a function off IO and^. H ranges from 0.9350 to 
1; M ranges from 0.4859 to 0.5521; L ranges from 0.0443 to 0.1065. (b) Average payoff ratio between the individuals who bluff and those who do not 
(R = Payoj fbiufllPayo) j f rwn -biuff) as a function of/ /0 andf B , when their real abilities are low (R 1 ), middle (R M ), and high(R H ) respectively. R L ranges 
from 1.03 to 8.62; R M ranges from 1.01 to 1.634; R H ranges from 1.002 to 1.322. Data point is obtained by averaging over a period of 10000 time steps, and 
we have checked that longer time period does not qualitatively change the results. Other parameters: a = 0.4, /J = 0.3, and rlc = 3. 
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Figure 6 | Stable fractions of overconfident individuals f 0 and bluffing individuals f B as a function of r/c for different values of a and /f, when 
players can learn both the overconfidence state and the bluffing state of others. In Panel (a), a — 0.4 and ft — 0.3. Bluffing prevails once r/c exceeds the 
critical value 2.0. Overconfidence does not drop to lower level with increasing r/c and remains above 0.89. In Panel (b), a — 0.7 and — 0.6. Bluffing 
almost dominate the whole range of r/c. Overconfidence stays above 0.98 for r/c > 0.5. Other parameters: f }0 — 0.5, and f IB — 0.5. 



provide some insight into understanding this widespread erroneous 
psychology in human society. 

Methods 

Mathematical model. Within this work, the basic model is a resource competition 
game (RCG). Without loss of generality, an individual i in our model is characterized 
by real capability overconfidence intensity a, and bluffing intensity /?, where 0 f £ 
© = {1, 1.5, 2}, corresponding to low (L), middle (M), and high (H) real capabilities, 
respectively. The value of a represents the perception error of a player about his own 
ability when he is overconfident, while /? represents how much stronger a player 
displays than he really is. Whoever dare compete for resource is at least confident, so 
we assume no self-abasement in this model. In a RCG, i belives he owns a "self- 
perceived capability" /c, as: 



kj = Qj + UjOL 



(1) 



where Uj = 1 if i is overconfident and 0 otherwise. He also exhibits his "displaying 
capability" m ; as: 

mi = 0i + VijS (2) 

where v ; — 1 if i bluffs and 0 otherwise. Supposing a resource r is potentially available 
to individuals that claim it, and two players i and j both can claim this resource, a RCG 
takes place. If neither individual claims the resource, then no fitness is gained. If only 
one individual makes a claim, then it acquires the resource and gains fitness r and the 
other individual gains nothing. If both individuals claim the resource, then each 
individual pays a cost c due to the conflict between them, and the results depend on 
their real capabilities. Therein player i faced with j gains a payoff Pa, which can be 
calculated as: 

• (1) If ki > ntj and h < m jt i claims but j does not, and thus P# = r. 

• (2) If ki < rrij, i will not claim, and Pa — 0. 

• (3) If ki > rrij and h > ffi/, a conflict happens between i and j, whose real 
capabilities determine what they can get: If 8, > dp P» = r — c; If ft, < £L 
Py — — c, If Bj — dp Pij = r/2 — c, i.e., r is equally distributed between them after 
a battle. 

• (4) If ki — mi, the relation between r and c should be considered: If r ^ 2c, i will not 
claim because the upcoming conflict promises no positive benefit, and thus Pa = 
0; If r > 2c and kj < m„ i gains the whole resource without any payout, and P« = r; 
If r > 2c and kj ^ m„ the clash is inevitable, and therefore P« can be calculated as 
in case (3). 

Players are located on the vertices of L X L square lattices with periodic boundary 
conditions. Capability rank and bluffing state (BS) are considered to be natural 
instincts, and thus are constant throughout the evolution. Individuals can only 
imitate overconfidence states (OS) of others. In an extended model we also explore the 
case where individuals can mimic both OS and BS of others. f IB denotes the fraction of 
individuals that bluff. Monte Carlo simulations of the game are carried out com- 
prising two elementary steps. Firstly, each player i collects its payoff P, by interacting 
with its four nearest neighbors through RCG. The payoff of player i can be calculated 



(3) 



where Q^-j represents players in i's neighborhood. Secondly, players consider chan- 
ging their OS synchronously. Specifically, player i adopts the OS of the randomly 
selected neighbor j with the probability 



TIP, 



-P.) = 



\+exp[(P t -P,)/K] 



(4) 



where K characterizes the level of uncertainty in strategy adoption 59 . Without loss of 
generality we use K = 0.1, and we have confirmed that different values of K do not 
qualitatively change the simulation results. 

Extended replicator dynamic. For comparison, evolutionary dynamics of 
overconfidence in well-mixed population is also investigated. In replicator 
dynamics, a differential equation governs the evolution of the densities of different 
strategies, lending to the notion of evolutionary stability 23 ' 60,61 . Further, 
enlightened by Ref 62 , we can consider the whole population to be specially 
community- structured 63-68 , depending on different character profiles of 
individuals. There are 6 character profiles, specified by a 2-bit feature name [a^] 
where a t £ [L, M, H] indicates the capability level of this individual is low, 
middle, or high, while a 2 £ [B, N] indicates whether he bluffs or not, indeed 
forming 6 communities. Players imitate overconfidence states of others but cannot 
change their characters, and thus the size of each community is constant and the 
members cannot migrate among communities. Strategies A {or B) indicates that 
an individual is overconfident (or not), while and y { (i = LB, LN, MB, MN, HB, 
HN) denote the fractions of individuals adopting strategy A and B in community i 
respectively. Here Xj + y { — 1. We use p,- to denote the fraction of community i 
members in the whole population, and then / ], pj= 1- Hence any individual 
interacts with a player from community i with probability p, according to the 
well-mixed assumption. In each community, the standard replicator equation 
takes the form of: 



Xi=Xi(f\ Ai -ni) 

yi=yt{nBi-Wd 



(i = LB, LN, MB, MN, HB, HN) 



(5) 



where i] Ai and are the fitness of overconfident and unbiased people in 
community i respectively, and Tfj=x,rj Ai -\-yit] Bi is the average fitness of individuals 
in community i. By solving above equations, we can obtain x*, the stable fixed 
state of overconfidence (SFSO) of community i. Hence we can derive the SFSO of 
the entire system as 

Assume that individuals are randomly distributed among the communities, i.e., 
for each community i{i = LB, LN, MB, MN, HB, HN), p { = 1/6, and that each one 
has equal probability 0.5 to be overconfident or not. Thus initial overconfidence 
level for each community is xf 1 — 1/12. Taking a = 0.4 and /? = 0.3 as an 
example, we can obtain the dynamic equations as: 
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(xhb + x H n) + 



3J (7) 



X ai N = Xa l N{l— I I2 _C J *g 



(ai=L,M,H) 



The stable equilibrium point of the above differential equations, i.e., the SFSO, is: 

{r<2c) 

r + 2c y 1 (8) 
1 (r>2c) 



Different SFSOs with respect to other overconfidence and bluffing intensities are 
shown in the main text. 
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